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April 2003 Proceedings of the seventh annual international conference on 
Computational molecular biology 

Full text available: ^ pdf(212.43 KB) Additional Information: full citation , abstract , references , index terms 

It is shown that structural similarity between proteins can be decided well with much less 
information than what is used in common similarity measures. The full Ca representation 
contains redundant information because of the inherent chain topology of proteins and a 
limit on their compactness due to excluded volume. A wavelet analysis on random chains 
and proteins justifies approximating subchains by their centers of mass. For not too 
compact chain-like structures in general, and ... 
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Shape similarity is one of the most elusive and intriguing questions of nature and 
mathematics. Proteins provide a rich domain in which to test theories of shape similarity. 
Proteins can match at different scales and in different arrangements. Sometimes the 
detection of common local structure is sufficient to infer global alignment of two proteins; at 
other times it provides false information. Proteins with very low sequence identity may 
share large substructures, or perhaps just a central core. ... 

Keywords: homology, homotopy, knot theory, protein structure, robot motion planning, 
writhing 
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Traditional approaches for content-based image querying typically compute a single 
signature for each image based on color histograms, texture, wavelet tranforms etc., and 
return as the query result, images whose signatures are closest to the signature of the 
query image. Therefore, most traditional methods break down when images contain similar 
objects that are scaled differently or at different locations, or only certain regions of the 
image match. In this pape ... 

44 Studying users: A graph-based recommender system for digital library Q 
Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen 

July 2002 Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries 

Additional Information: full citation , abstract , references , citings , index 



Full text available: ' ^ 
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Research shows that recommendations comprise a valuable service for users of a digital 
library [11]. While most existing recommender systems rely either on a content-based 
approach or a collaborative approach to make recommendations, there is potential to 
improve recommendation quality by using a combination of both approaches (a hybrid 
approach). In this paper, we report how we tested the idea of using a graph-based 
recommender system that naturally combines the content-based and collaborative ... 

Keywords: Hopfield net algorithm, Chinese phrase extraction, collaborative filtering, 
content-based filtering, graph-based model, mutual information algorithm, recommender 
system 
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Yang Chu, Liang-Tien Chia, Sourav S. Bhowmick 

November 2004 Proceedings of the 2nd ACM international workshop on Multimedia 
databases 

Full text available: ^ pdf(279.92 KB) Additional Information: full citation , abstract , references , index terms 

MPEG-7 documents, which are primarily for multimedia information exchange, are also 
data-centric XML documents. Due to its advantages, the relational DBMS is the best choice 
for storing such XML documents. Storing XML data in relational DBMS can be classified into 
two classes of storage model: structure-mapping and model-mapping. However, the 
structure-mapping model cannot support complex Xpath-based query efficiently and model 
mapping approach lacks the flexible capability in representing al ... 

Keywords: MPEG-7, SM3, relational DBMS, storing XML documents 
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Sid-Ahmed Berrani, Laurent Amsaleg, Patrick Gros 

November 2003 Proceedings of the twelfth international conference on Information and 
knowledge management 

Full text available" Wi df(154 57 KB) Additional Information: full citation , abstract , references , citings , index 
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It is known that all multi-dimensional index structures fail to accelerate content-based 
similarity searches when the feature vectors describing images are high-dimensional. It is 
possible to circumvent this problem by relying on approximate search-schemes trading-off 
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result quality for reduced query execution time. Most approximate schemes, however, 
provide none or only complex control on the precision of the searches, especially when 
retrieving the k nearest neighbors (NNs) of query poi ... 

Keywords: approximate nearest-neighbor searches, multimedia databases, similarity 
searches 
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computation of product recommendations 
Cai-Nicolas Ziegler, Georg Lausen, Lars Schmidt-Thieme 

November 2004 Proceedings of the Thirteenth ACM conference on Information and 
knowledge management 

Full text available: ^ pdf(269.57 KB) Additional Information: full citation , abstract , references, index terms 

Recommender systems have been subject to an enormous rise in popularity and research 
interest over the last ten years. At the same time, very large taxonomies for product 
classification are becoming increasingly prominent among e-commerce systems for diverse 
domains, rendering detailed machine-readable content descriptions feasible. Amazon.com 
makes use of an entire plethora of hand-crafted taxonomies classifying books, movies, 
apparel, and various other goods. We exploit such taxonomic backg ... 

Keywords: machine learning, recommender systems, taxonomies 



48 Information filtering: the computation of similarities in large corpora of legal texts Q 
Erich Schweighofer, Werner Winiwarter, Dieter Merkl 

May 1995 Proceedings of the 5th international conference on Artificial intelligence 
and law 

Full text available: pdf(748.19 KB) Additional Information: full citation , references , citings , index terms 
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Thomas Funkhouser, Michael Kazhdan, Philip Shilane, Patrick Min, William Kiefer, AyelletTal, 
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August 2004 ACM Transactions on Graphics (TOG), Volume 23 issue 3 
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In this paper, we investigate a data-driven synthesis approach to constructing 3D geometric 
surface models. We provide methods with which a user can search a large database of 3D 
meshes to find parts of interest, cut the desired parts out of the meshes with intelligent 
scissoring, and composite them together in different ways to form new objects. The main 
benefit of this approach is that it is both easy to learn and able to produce highly detailed 
geometric models — the conceptual design for ne ... 

Keywords: 3D shape matching, databases of geometric models, interactive modeling tools 



50 Image retrieval: Content representation and similarity matching for texture-based 
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Noureddine Abbadeni 

November 2003 Proceedings of the 5th ACM SIGMM international workshop on 
Multimedia information retrieval 

Full text available: *p 3 pdf(409.30 KB) Additional Information: full citation , abstract , references , index terms 
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This paper addresses the fundamental issues of visual content representation and similarity 
matching in content-based image retrieval and image databases in general. Simply stated, 
defining an image retrieval system is equivalent to find answers to two fundamental 
questions: 1. Representation model or which features are used to represent the content of 
images; 2. Once the set of features representing the content of images is determined, the 
question of how to combine the individual or partial si ... 

Keywords: auto regressive model, content-based image retrieval, perceptual evaluation, 
perceptual model 



51 Database theory, technology and applications (DTTA): Integrating similarity-based 
queries in image DBMSs 

Solomon Atnafu, Richard Chbeir, David Coquil, Lionel Brunie 

March 2004 Proceedings of the 2004 ACM symposium on Applied computing 

Full text available: ^ pdf(381.23 KB) Additional Information: full citation , abstract , references , index terms 

Until recently, issues in image retrieval have been handled in DBMSs and in computer vision 
as separate research works. Nowadays, the trend is towards integrating the two approaches 
(content- and metadata-based) for multi-criteria image retrieval. However, most existing 
works and proposals in this domain lack a formal framework to deal with a multi-criteria 
query. In this paper, we introduce a formal framework to address this subject of image 
retrieval under an ORDBMS model. We first propose an ... 

Keywords: image DBMS, multi-criteria retrieval, multimedia algebra 



52 Social browsing: Studying the effect of similarity in online task-focused interactions Q 
Dan Cosley, Pamela Ludford, Loren Terveen 

November 2003 Proceedings of the 2003 international ACM SIGGROUP conference on 
Supporting group work 



Although the Internet provides powerful tools for social interactions, many tasks-for 
example, information-seeking-are undertaken as solitary activities. Information seekers are 
unaware of the invisible crowd traveling in parallel to their course through the information 
landscape. Social navigation systems attempt to make the invisible crowd visible, while 
social recommender systems try to introduce people directly. However, it is not clear 
whether users desire or will respond to social cues ind ... 

Keywords: community, demographics, friendship, matchmaking, recommender systems, 
similarity, social navigation 



Poster papers: Similarity measure based on partial information of time series 
Xiaoming Jin, Yuchang Lu, Chunyi Shi 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: ^pdf(704.61 KB) Additional Information: full citation , abstract , references , index terms 

Similarity measure of time series is an important subroutine in many KDD applications. 
Previous similarity models mainly focus on the prominent series behaviors by considering 
the whole information of time series. In this paper, we address the problem: which portion 
of information is more suitable for similarity measure for the data collected from a certain 
field. We propose a model for the retrieval and representation of the partial information in 
time series data, and a methodology for evaluat ... 
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54 Using extended feature objects for partial similarity retrieval 
Stefan Berchtold, Daniel A. Keim, Hans-Peter Kriegel 

November 1 997 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 6 Issue 4 
Full text available: ^ pdf(679.89 KB) Additional Information: full citation , abstract , citings , index terms 

In this paper, we introduce the concept of extended feature objects for similarity retrieval. 
Conventional approaches for similarity search in databases map each object in the database 
to a point in some high-dimensional feature space and define similarity as some distance 
measure in this space. For many similarity search problems, this feature-based approach is 
not sufficient. When retrieving partially similar polygons, for example, the search cannot be 
restricted to edge sequences, since simil ... 

Keywords: CAD databases, Fourier transformation, Indexing and query processing of 
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Motivated by the urgent need to improve the efficiency of similarity queries, approximate 
similarity retrieval is investigated in the environment of a metric tree index called the M- 
tree. Three different approximation techniques are proposed, which show how to forsake 
query precision for improved performance. Measures are defined that can quantify the 
improvements in performance efficiency and the quality of approximations. The proposed 
approximation techniques are then tested on various synthet ... 

Keywords: Access structures, Approximation algorithms, Distance only data, Performance 
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The notion of similarity between objects finds use in many contexts, for example, in search 
engines, collaborative filtering, and clustering. Objects being compared often are modeled 
as sets, with their similarity traditionally determined based on set intersection. Intersection- 
based measures do not accurately capture similarity in certain domains, such as when the 
data is sparse or when there are known relationships between items within sets. We 
propose new measures that exploit a hierarchical ... 
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Similarity based retrieval from sequence databases is of importance in many applications 
such as time-series, video and textual databases. In this paper, automata based formalisms 
are introduced for specifying queries over such databases. Various measures defining the 
distance of a database sequence from an automaton are defined. Efficient methods for 
similarity based retrieval are presented for each of the distance measures. These methods 
answer nearest neighbor queries (i.e. retrieval of k do ... 
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The integration of distributed, heterogeneous databases, such as those available on the 
World Wide Web, poses many problems. Herer we consider the problem of integrating data 
from sources that lack common object identifiers. A solution to this problem is proposed for 
databases that contain informal, natural-language "names" for objects; most Web-based 
databases satisfy this requirement, since they usually present their information to the end- 
user through a veneer of text. We des ... 
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A framework for retrieving images by spatial similarity (FRISS) in ima ge databases is 
presented. In this framework, a robust retrieval by spatial similarity (RSS) algorithm is 
defined as one that incorporates both directional and topological spatial constraints, 
retrieves similar images, and recognized images even after they undergo translation, 
scaling, rotation (both perfect and multiple), or any arbitrary combination of 
transformatioins. The FRISS framework is discussed and used as a ba ... 
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